SPARSE RECOVERY, KASHIN DECOMPOSITION AND CONIC PROGRAMMING 



ALEXANDRE D' ASPREMONT 



Abstract. We produce relaxation bounds on the diameter of arbitrary sections of the li ball in R". We use 
these results to test conditions for sparse recovery. 



1. Introduction 

Let A E R"^x" be a full rank matrix, we are given m observations Au of a signal u G R", and we seek 
to decode it by solving 

minimize Card(a;) 
subject to Ax = Au, 

in the variable x £ R". Problem (1) is combinatorially hard, but under certain conditions on the matrix A 
(see e.g. Donoho and Tanner (2005); Candes and Tao (2005); Kashin and Temlyakov (2007); Cohen et al. 
(2009)), we can reconstruct the signal by solving instead 

minimize ||x||i 
subject to Ax = Au, 

which is a convex problem in the variable x G R". 

2. Sparse recovery conditions 

We begin by discussing conditions on the coding matrix A G R™x" and on the signal u which guarantee 
that the solution to the £i minimization problem (2) matches that of the io minimization problem (1) and 
allows us to reconstruct the original signal u. 

2.1. Discrete signals. We first assume that the signal u only takes discrete values. For a given coding 
matrix A G R"^^", the proposition below describes a sufficient condition which guarantees that a discrete 
signal uG{— 1,0,1}" will be reconstructed by solving problem (2). 

Proposition 2.1. We define 

^/=|nGR": n'^x + C^|uiXi| < Clk||i,Vx G R" : Ax = o| . (3) 

If u G { — 1,0, 1}" n U for some ^ G (0, 1) and z G R" solves the ii recovery problem in (2), then the 
signature of z is a subset of that ofu, i.e. UiZi = \zi\, i = 1, . . . ,n. 

Proof. Suppose there is a vector z G R", with Az = Au and ||z||i < ||n||i. Let / = {i G [1, n] : Ui ^ 0} 
be the support of the signal u and J its complement in [1, n], the vector u — z is in the nuUspace of A so 
u gU implies 

n 

U^{u - z) +^^1^411^^ - Zi\ < ^\\u - z\\i. 
i=l 

Because Ui G {—1, 0, 1} this is equivalent to 

u^{u - z) < i\\zj\\i, 
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hence, having assumed ||2:||i < ||u||i,we get 

Iklli = ll^/lli + Ikilli < ll^lli < z + CWzjWi 

< +^lkj||i, 

so II^jIIi = 0. Then ||z||i < < u^z < \\z\\i means UiZi = \zi\, i = 1, . . . ,n. u 

Given a priori bounds on the signal coefficients, we obtain the following (tighter) result, which ensures 
that the signature of the decoded signal matches that of the true one, when solving a modified version of 
problem (2). 

Corollary 2.2. Let z solve 

minimize \\x\\i 

subject to Ax = Au (4) 

ll-^lloo — 1; 

Ifu G { — 1, 0, 1}" n Ufor some ^ £ (0, 1), where U was defined in (3), then z = u. 

Proof. In the proof of Proposition 2.1, we showed z = \\z\\i = \\u\\i under the same assumptions, 
which together with the additional constraint that || 

-^lloo ^ 1 means that z — i 

Next, we show that controlling the ratio of dual pairs of norms on the nuUspace of A provides simple 
sufficient conditions for checking that a signal u belongs to the set U of -recoverable signals. 

Proposition 2.3. Let \\ ■ \\ be a norm on R" and \\ • ||* its dual, A G R™-><" and u G {—1,0, 1}", if 

sup ||x|| < -rr—rr- (5) 

Ax=o, IfII* 

||x||i<l 

then u £U, where U is the set of li-recoverable signals defined in (3). 
Proof. When (5) holds 

, ,r n n ^ 

sup |n| |x| < sup ||x|| < 



Ax=0 Ax=0, 1 + 

ll^l|i<l l|2:||i<l 



for some ^ G (0; l)^ where |ii| is the vector with components \ui\. We then have 

sup + ^ diag(i(;))x < (1 + ^) sup |n|"^|x| < ^ 

Ax=0,||x||i<l Ax=0 
«;£{-!, 1}" lla;||l<l 

which means u GU.m 

We can bound the value of the sup in (5) when || • || is the Euclidean norm and the matrix A satisfies the 
restricted isometry property of order k* with constant 5 < I. 

Lemma 2.4. Suppose A G R*"^" satisfies the Restricted Isometry Property (RIP) of order 3k* with con- 
stant 63k* < 5 < 1, then 

sup 7^ 77- < 7=. (6) 

Ax=olklli (1-(5)VA;* 

Proof. We roughly follow the proof of Cohen et al. (2009, Lemma 4. 1). Let 77 G R" be in the nullspace 
of A. Let T = To be the index set of the k* largest magnitude coefficients in rj, with Ti corresponding to 
the next k* largest and so on. Cohen et al. (2009, Lemma 4.1) show 

. {1 + 6) ||r?r'=||i 



(l-S) Vk* 
and 
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so 

II II ^ ll^lll 

and a triangular inequality yields the desired result. ■ 

As discussed in (Donoho, 2006; Kashin and Temlyakov, 2007) this result is in fact a direct consequence 
of classical bounds on Gel'fand and Kolmogorov widths, with Kashin (1977); Garnaev and Gluskin (1984) 
showing in particular that 



sup 



< 



Ax=o \\x\\i Vn 

for some matrices A (the proof is not constructive). Moreover, Kashin (1977) shows that this holds with high 
probability when the nuUspace of A is picked at random uniformly on the Grassman manifold of subspaces 
of R" with dimension k < n/2. In other words, when n is large, most matrices are good sensing matrices. 

2.2. Generic signals. Very similar results hold for arbitrary signals u G R" at marginally lower thresholds. 
In particular, Kashin and Temlyakov (2007, Th. 2.1) show the following guarantee. 



Proposition 2,5. Given a coding matrix A G R™^"_ suppose that there is some 5 > such that 

\\xh / 1 
sup < —= 

Ax=0 \\x\\l VS 

then x^^ = u //'Card(u) < S/A, and 



sup ^f-^ < (7) 



llu — x^^||i< 4 min ll'U — 

{Card(j/)<5/16} 

where x^^ solves the i\-recovery problem in (2) and u is the original signal. 

This means that the £i -minimization problem in (2) will recover exactly all sparse signals u satisfying 
Card(u) < S/A and that the £i reconstruction error for other signals will be at most four times larger than 
the ii error corresponding to the best possible approximation of n by a signal of cardinality at most 5/16. 

3. Weak recovery conditions 

Similar conditions (with slightly better recovery thresholds) can be derived when the signal u follows a 
given distribution, and recovery is only required to occur with high probability. Given k € [0, n], suppose 
now that the signal is i.i.d., distributed as follows 

{—1 with probability A;/2n 
+1 with probability A;/2n (8) 
otherwise, i = 1, . . . ,n. 

The condition defining U in (3) can be written 

max < u^x + £ \^ UiWiXi > < f 
^£{-1,1}" 1 trf J 

Ax=0,\\x\\i<l ^ ' 

and because the maximum is taken over a polyhedral set, this can be understood as 

rp 

max It X < ^ 

where T C R" is a finite set. When u is distributed as in (8), the left-hand side max^^T" uF x of this last 
condition is a Rademacher process whose mean and fluctuations can be controlled, as detailed in the lemma 
below. 
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Lemma 3.1. Let A G R™^"^ with ^ > and u distributed as in (8), define 



S{A) = max 



Ax=Q \\x\\i 

then the expected value of the max. can be bounded by 



(9) 



M{A) = E 



max iux 

«)€{-!, 1}" 
Aa;=0,||a:||i<l 



i=l 



<{l + i)S{A)^[\\uh] 



and 



Prob 



max <ux 

«>G{-1,1}" 
Aa;=0,||x||i<l ^ 



+ i^^uiWiXi \>i 

i=l ) 



(i-M(A)Y 

< 4e 4(i+o^s^(A) 



whenever M{A) < ^. 

Proof. First, remember that 



max u (I + £ diaLs(w))x < (1 + C) max \u\ \x\ 

ylx=0,||x||i<l a\ jj - \ ^^^^ 

«)£{-!, 1}" ll^:||l<l 

where \u\ is the vector with components \ui\ here. Then, a Cauchy-Schwarz inequahty yields 



E 



max (1 + f )Ki \x\ 

Ax=0 ^ ■'^i I I I 
||x||i<l 



< (1 + max ||x||2 E [11^112] 

Ax=0 
||x||i<l 



We then note that 



max max 11 (I + diag(?i;))x||2 = (1 + ||x||2, 
we{-l,l}" Ax=0 Ax=0 
||a;||i<l l|2;||i<l 

when ^ G (0, 1), and the concentration inequality follows from (Ledoux, 2005, Cor. 4.8). ■ 

We summarize these last results in the following proposition, which highlights the role played by S{A) 
in controlling the probability of recovering the signal u. 



Proposition 3.2. Suppose the signal u is distributed as in (8), f3 > and A G R™x" satisfies 
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S{A)< 



E[||n||2] + 2/3 + 4^^ 



(10) 



then 



Prob [u^U]< Ae-I^ 
where lA is the set of ii-recoverable signals defined in (3). 

Proof. If S{A) < l/(E[||u||2] + 2/3 + 4^), then there is a ^ G (0, 1) such that M{A) < ^ and 

(l + e)5(>l)(E[||n||2] + 2/3 + 4V^)<e, 
Lemma 3. 1 then yields the desired result. ■ 

Because E[||^i||2] < Vk (by Jensen's inequality), recovery with high probability can be obtained at 
slightly higher cardinalities k than those required for recovery of all signals. Of course, this discrepancy 
vanishes if the random model for u has uniformly distributed support of size exactly k. Here however, other 
choices of norm in (5) might produce different results. 

4. Tractable bounds 

In this section, we discuss methods to efficiently bound the ratio S{A), i.e. control the Banach-Mazur 
distance of £i and £2 on the nuUspace of A. 

4 



4.1. Semidefinite relaxation. We now show how to compute tractable bounds on the ratio 

S(A) = max || — p-, 
Ax=o \\x\\i 

defined in (9). This question is directly connected to the problem of efficiently testing Kashin decomposi- 
tions (see (Szarek, 2010, §4.1) for a discussion). We first formulate a semidefinite relaxation of this problem. 

Lemma 4.1. Let A G R™^", 

S(Af<SDP(A)= max Tr X (11) 

Tr(A^AX)=0 
Il^l|i<l,^h0 

where SDP{A) is computed by solving a semidefinite program in the variable X G S^- 
Proof. Writing X = xx^ , we have 

S{Af = max Tr X 

Tr(yl^AX)=0, \\X\\l<l, 
Rank(X)=l,XhO 

and dropping the rank constraint yields the desired result. ■ 

We now connect the value of S{A) with that of the function ai{A) defined in (Juditsky and Nemirovski, 
2008; d'Aspremont and El Ghaoui, 2008) as 

Q.iiA) = max -Ir-ir") (12) 

Ax=0 \\x\\i 

which can be computed by solving either a linear program (Juditsky and Nemirovski, 2008) or a semidefinite 
program (d'Aspremont and El Ghaoui, 2008). The following lemma bounds S{A) using ai {A). 

Lemma 4.2. Let ^4 G R™-^", we have 



ai{A) < SiA) < yJSDP{A) < ^/aliA) 

Proof. The first inequality simply follows from ||x||oo < ll^^lb^ the second from Lemma 4.1. If X 
solves (11), Tr{A^AX) = implies AX = 0, which means that the columns of X are in the nuUspace 
of A. Bydefinitionof ai(A), wethenhaveXjj = \\Xi\\oo < ai(^)||Xj||i, hence Tr(X) < ai(A)||X||i < 
ai{A), which yields the desired result. ■ 

The following proposition shows that if a matrix allows recovery of all signals of cardinality less than k*, 
then the SDP relaxation above will efficiently certify recovery of all signals up to cardinality 0{k* / y/n). 
This is a direct extension of Lemma 4.2 and Proposition 2.5. 

Proposition 4.3. Suppose A G r"^x" satisfies condition (J) for some S > 0, the semidefinite relaxation 
will satisfy 

S{A) < y/SDP{A) < S-^ (13) 
and the semidefinite relaxation will certify exact decoding of all signals of cardinality at most ^/S. 



Proof. From Lemma 4.2, we know that ai < S{A) hence SDP{A) < y^S(A). We conclude using 
Proposition 2.5. ■ 

We can produce a second proof of this last result, which uses the norm ratio in (9) directly. 

Proposition 4.4. Suppose A G R™^" satisfies condition (1) for some S > 0, the semidefinite relaxation 
will satisfy 

S{A) < ^/SDP{A) < S--i (14) 

and the semidefinite relaxation will certify exact decoding of all signals of cardinality at most ^/S. 
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Proof. If X solves the SDP relaxation in (11), with S{A) = 5 in (9), then the rows of X are in the 
nullspace of A, and satisfy ||Xj||2 < ll^i|/v^- Then, with 



" " IIXII 1 

TrX<Y,\\X,U<Y,\\X,h<\^< — 



,=1 ^ ~ y/S 



hence the desired result, i 



Note that we are not directly using X ^ in this last proof, so the semidefinite relaxation can be replaced 
by a linear programming bound 

LP{A)= max. Tr X 

s.t. AX = Q (15) 

||X||i < 1 

We now show that the S*"^/^ bound is typically the best we can hope for from the relaxation in (11). 

Proposition 4.5. Suppose A G R'^x" with n = 2m, then 

1 



< SDP{A) (16) 

and the semidefinite relaxation will certify exact decoding of all signals of cardinality at most 0{y/rn). 

Proof. Let Q be the orthoprojector on the nullspace of A. We have Q ^ 0, Tv{Q) = m, \\Q\\f = 
and HQ 111 < V^HQIIf < n^fra, which means that X = Q/{ny/m) is a feasible point of the SDP 
relaxation in (11) with TrX = y/m/n = l/\/2n which yields the required bound on the optimal value 
of (ll).B 

This means that if the matrix A allows exact recovery of signals with up to (an unknown number) S 
nonzero coefficients, then our relaxation will only exact certify recovery of signals with cardinality 0{^fS). 
The fact that approximating the recovery threshold S is hard is not entirely surprising, S in (7) is the 
Euclidean diameter of the centrally symmetric polytope {x G R" : Ax = 0, ||x||i < 1}. Computing 
the radius of convex polytopes is NP-Complete (Freund and Orlin, 1985; Lovasz and Simonovits, 1992; 
Gritzmann and Klee, 1993; Brieden et al., 2001). In particular, Lovasz and Simonovits (1992) show that if 
we only have access to an oracle for K, then there is no randomized polynomial time algorithm to compute 
the diameter of a convex body K within a factor v}^^. In that sense, the approximation ratio obtained above 
is optimal. Here of course, we have some additional structural information on the set K (it is a section of the 
£i ball) so there is a possibility that this bound could be improved. On the other hand, in the next section, 
we will see that if we are willing to add a few random experiments to A, then the diameter can be bounded 
with high probability by a randomized polynomial time algorithm. 

5. Geometric bounds 

Proposition 2.5 establishes a link between the sparse recovery threshold S" of a matrix A and the diameter 
of the polytope {x G R" : Ax = 0, ||x||i < 1}. In this section, we first recall some classical results of geo- 
metric functional analysis and use these to quantify the sparse recovery thresholds of arbitrary matrices A. 

5.1. Dvoretzky's theorem. We first recall some concentration results on the sphere as well as classical 
results in geometric functional analysis which control, in particular, the diameter of random sections of the 
£i ball (i.e. where A is chosen randomly). Let a be the unique rotation invariant probability measure on the 
unit sphere S"~^ of R", and || • ||i^ be a norm on R" with unit ball K, then 

a{xe S"-^ : |||x|| -M(i^)| > tM{K)] < g-'^^^)*' (17) 



with 



where c > is a universal constant, and 

M{K)= / \\x\\da{x) and h{K) = sup ||x||. (19) 

Klartag and Vershynin (2007) call k{K) the Dvoretzky dimension of the convex set K. Part of the proof 
of Dvoretzky's theorem states that random sections of K with dimension k = k{K) are approximately 
spherical with high probability (w.r.t. the uniform measure on the Grassman Qn.k)- We write the ip ball 
ofR". 

Theorem 5.1 (General Dvoretzky). Let E C be a subspace of dimension I < k{K) defined in (18), 
chosen uniformly at random w.r.t. to the Haar measure on Gn,k> then 

^(BJnE)c(A'n£)c^WnE) 

with probability 1 — e~'^^K where ci, C2, C3 > are absolute constants. 

Proof. See (Milman and Schechtman, 1986, §4) or (Vershynin, 2011, Th. 6.4) for example. ■ 

This result means that random sections of convex bodies with dimension k are approximately spherical 
with high probability. Milman and Schechtman (1997) show that the threshold k{K) is sharp in the sense 
that random sections of dimension greater than k{K) are typically not spherical. Because projections of 
sphere are spheres, there is thus a phase transition at k{K): random sections of K become increasingly 
spherical until they reach dimension k{K) below which they are approximately spherical with high proba- 
bility. 

The diameter follows this phase transition as well, and the following result characterizes its behavior as 
the dimension of the subspace decreases (we write K* the polar of K). 

Theorem 5.2 (Low M* estimate). Let E C R" be a subspace of codimension k chosen uniformly at 
random w.r.t. to the Haar f}l€ClSUT€ Ofl Qn.n—k:i thcfl 

diam(A'nS) < cJ-M{K*) 

with probability 1 — e~^, where c is an absolute constant. 

Proof. See (Pajor and Tomczak-Jaegermann, 1986) for example. ■ 



The value of M{K*) is known for many convex bodies, including Ip balls. In particular, (5")* = B'^ 
and M{B'^) ^J\ognJn asymptotically. This means that random sections of the li ball with dimension 
n — k have diameter bounded by 

diam(S{' nE)< c^J^^ 

with high probability, where c is an absolute constant (a more precise analysis allows the log term to be 
replaced by log{n/k)). 

Theorem 5.3 (Low M estimate). Let A G (0, 1) and k = [AnJ and E <zW^ be a subspace of codimension 
k chosen uniformly at random w.r.t. to the Haar 

M{K) > \/A 

then 

c\/l — A 
diam(A: n E) < ^ 

^ ^ " M{K) - VX 

with probability 1 — C2e~'^3'^^(^~^)", where 

M^{K) - A 
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1 - W^iK) 
1 



and ci , C2 , C3 are absolute constants. 



Proof. See (Giannopoulos et al., 2005, Th.B). ■ 



Note that the condition B2 C K means the set K needs to be normalized by b{K). Klartag (2004) 
recently produced a similar result using M{K) together with volume ratios. This result applies to all values 
of M{K)/b{K), unfortunately, the dependence on k is exponential instead of being polynomial. 

5.2. Connection with sparse recovery. We have seen in Proposition 2.5 that the sparse recovery threshold 
associated with the m linear observations stored in ^ G jjmxn^ ■ ^ largest signal cardinality for which 
all signals u can be recovered exactly by solving the £1 -minimization problem in (2), is given by the radius 
(or diameter) of the centrally symmetric convex polytope {x G R" : Ax = 0, ||x||i < 1}. By homogeneity, 
this is equivalent to producing lower bounds on over §"'™^^, the unit sphere of R"~"^. 

Proposition 2.5 (or Kashin and Temlyakov (2007)) shows that the sparse recovery threshold S of the 
observations A G R"^><" satisfies 

Q > i (20) 

- diamiix e R"" : Ax = 0, \\x\\i < 1})^ ^ ' 



The low M* estimate in Proposition 5.2 together with the fact that M{B^) ~ Y^log n/n then shows that 
choosing m linear samples A G Rmx" uniformly at random in the Grassman will allow us, with high 
probability, to recover all signals with at most ^^^^ nonzero coefficients, by solving the £1 minimization 
problem in (2) (again, the log term can be replaced by log(n/A;)). 

5.3. Approximating the diameter. As we have seen above, finding good compressed sensing experiments 
means finding matrices A G R™>^" for which ||Fy||i is almost spherical, where F is any basis for the 
nuUspace. Bad matrices are matrices for which the norm ball of ||-Fy||i is much closer to a cross-poly tope. 
This section is thus focused on measuring how spherical actually is. The key difficulty in high 

dimensions is that all centrally symmetric convex bodies look like spheres, except for a few "spikes" (or 
tentacles in Vershynin (2011)) with negligible volume, hence precisely characterizing the diameter using 
only probabilistic arguments is delicate. 

If we notice that ||Fy||i defines a norm on R"^*", we can try to apply Dvoretzky's result in the normed 
space (R"""*, ||Fy||i) instead of (R", ||x||i). The Dvoretzky dimension k{K) would then act as an indirect 
measure of how Euclidean ||-Fy||i is. In compressed sensing terms, k{K) computed in (R"^™, ||Fy||i) will 
measure how many random experiments need to be added to the matrix A so that all signals of size 0*{n) 
can be recovered exactly by solving the ^1 -minimization problem in (2). The low M estimate makes this 
statement even more explicit: Theorem 5.3 direcdy links the ratio M{K)/b{K) and the number (1 — A)n 
of random experiments that need to be added to reach recovery threshold S (through the diameter). 

5.3.1. Approximating the Dvoretzky dimension. We will see below that the quantities M{K) and h{K) 
which characterize the phase transition for sections of the norm ball of can be approximated ef- 

ficiently. We first recall a result which can be traced back at least to (Nesterov, 1998a; Steinberg and 
Nemirovski, 2005), approximating the mixed || • ||2-s-i operator norm by a MAXCUT type relaxation. 

Proposition 5.4. Let F G R"X"-™ then 

-SDP{F) < max \\Fx\\l < SDP{F) (21) 

TT l|a;||2<l 

where 

SDP{F) = max. Tr (XFF^) 

s.t. diag(X) = 1 (22) 
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Proof. We can write 

max ||-Fx||| = max u FF u 

||a;||2<l ll"l|oo<l 

and by convexity of u^FF^u this is equal to 

rp rp 

max u FF u 

ne{-l,l}" 

and Nesterov (1998b) (using again the fact that FF^ is positive semidefinite) shows that this problem can 
be approximated within a factor 2/7r by the semidefinite relaxation in (22). ■ 

This means that the mixed norm b{K), which is typically hard to bound in probabilistic arguments, is 
approximated within a factor 2/7r by solving a MAXCUT semidefinite relaxation when the norm ball is a 
section of the li ball. We now recall a classical result showing that the spherical average M{K) can be 
approximated by a Gaussian average. 

Lemma 5.5. Let f be a homogeneous function on R", then 

where a is the Haar measure on the sphere and g ~ A/'(0, 

Proof. Because the Gaussian measure 7 is invariant by rotation, uniqueness of the Haai" measure on §"^^ 
means that 

/ f{x)da{x) = Xn ||x||2/(2;/||x||2)(i7(2;) = A„ / f{x)dj{x) 
for some constant A„ satisfying 

An = / \\x\\2dj{x) 

and we conclude using 

R" T{n/2) 'iyjn 

as n goes to infinity. ■ 

We can now easily compute M{K), when K is the unit ball of with 

where Fi are the rows of the matrix F, with F £ ^nxn-m satisfying AF = 0. The key difficulty with 
these approximations of the Dvoretzky dimension is that M{Bf) is roughly equal to ^Injn, so the ratio 
M{K)/b{K) is already constant and the 2/ir approximation ratio for b{K) only produces trivial bounds. 
Hence, even though we can expect matrices with high approximate ratio M (K) / SDP{F) to be good sens- 
ing matrices, there are no guarantees that all such matrices will have high approximate ratios. 

5.3.2. Approximating M*{K). We can also use the low M* bound in Theorem 5.2 to produce bounds on 
the diameter. Once again, the idea here is to apply this bound in the normed space (R"~"^, instead 
of (R", ||x||i), i.e. measure how many random experiments need to be added to the matrix A so that all 
signals of size S can be recovered exactly by solving the ^1 -minimization problem in (2). Solving for the 
dual norm is a convex problem, hence we can simply approximate M* by simulation. In the particular case 
of (R"~"^, this means computing 

E [max||^j^||^<i y^g] = E [min^T^^^ lk||oo] = E [rahipT^^Q \\Fg + x||oo] (24) 

by duality, where g ~ AA(0, In-m) (and assuming F^F = I„_m)- Sampling both terms simply means 
solving one linear program per sample. Also, a simple Cauchy inequality shows that M{K*) is bounded 
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above by 0(1/ V^)- Since the target precision for our estimate of M{K*) is always larger than 1 / ^/n, this 
produces a recipe for a randomized polynomial time algorithm for estimating S. In fact, following (Bourgain 
et al., 1988; Giannopoulos and Milman, 1997; Giannopoulos et al., 2005), if C R" is a symmetric convex 
body, < (5, /? < 1 and we pick N points Xi uniformly at random on the sphere S"^^ with 

clog(2//3) 



N 



52 



where c is an absolute constant, then 



1 ^ 



+ 1 



< 6M{K*) 



i=l 



with probability I — f3. 
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